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WALSH SPACES CONTAINING SMOOTH FUNCTIONS AND 
QUASI-MONTE CARLO RULES OF ARBITRARY HIGH ORDER 

JOSEF DICK* 

Abstract. We define a Walsh space which contains all functions whose partial mixed derivatives 
up to order 8 > 1 exist and have finite variation. In particular, for a suitable choice of parameters, 
this implies that certain Sobolev spaces are contained in these Walsh spaces. For this Walsh space 
we then show that quasi-Monte Carlo rules based on digital (t, a, s)-scquences achieve the optimal 
rate of convergence of the worst-case error for numerical integration. This rate of convergence is also 
^—s , optimal for the subspace of smooth functions. Explicit constructions of digital (t, a, s)-sequences are 

given hence providing explicit quasi-Monte Carlo rules which achieve the optimal rate of convergence 
J.^ ■ of the integration error for arbitrarily smooth functions. 

^^ Key words. Numerical integration, quasi-Monte Carlo, digital nets and sequences, Walsh 

functions 
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1. Introduction. Quasi-Monte Carlo rules are quadrature rules which aim to 
approximate an integral L {) lls f(x) dx by the average of the N function values f(x n ) 
,_£^ \ at the quadrature points Xq, . . . , Xjv-i G [0, l] s (and hence are equal weight quadra- 

ture rules). The dimension s can be arbitrarily large. The task here is to find ways of 
how to choose those quadrature points in order to obtain a fast convergence of the ap- 
proximation to the integral. Explicit constructions of quadrature points in arbitrary 
high dimensions are until now available for the following two cases: 

1. for sufficiently smooth periodic functions arbitrary high convergence can be 
achieved using Kronecker sequences [TSJ Theorem 5.3] or a modification of 

(vj ■ digital nets recently introduced in [2]; 

2. a convergence of 0(iV _1 (log A^)* -1 ) can be achieved for functions of bounded 
variation (in this case the functions are not required to be periodic). 

^+ | For non-periodic functions no explicit constructions have been established which can 

fully exploit the smoothness of the integrand. This paper provides a complete solution 

C*") | to this problem. 

Among other things we show that an explicit construction of suitable point 
sets and sequences can be obtained in the following way: let d > 1 be an inte- 
ger and let xq,Xi, ... € [0, l) ds be the points of a digital (t,m,ds)-net or digital 
(£, ds)-sequence over a finite field ¥ q in dimension ds (see [20] for the definition of 
digital nets and sequences and see for example [HI [201 EH 1221 131] for explicit con- 
structions of suitable digital nets and sequences). Let x n — (x Hj i, . . . ,x n ,ds) with 
X n ,j = Xn,jsq~ x + x nt j,2q~ 2 + •" and x n ,j,i & {0, . • . , q - 1} (i.e. x n j^ are the digits 
in the base q representation of x n j). Then for n > we define y n = {y n ,i, ■ ■ • > Un,s) 
with 

oo d 

Vn-J = ^2 X! X n,(j-l)d+k,iq~ k ~ {t ~ 1)d for j = 1, • • • , S. 
i=l k=l 

(Note that the addition here is carried out in R and that the sum over i above is 
often finite as x n ,j,i — for i large enough.) We point out here that the quality of the 
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point set or sequence is directly related to the i-value of the underlying (t, to, ds)-net 
or (t, (is)-sequence, see Theorem 14.111 and Theorem 14. 121 

Corollary |5.5l now shows that quasi-Monte Carlo rules using the points y , . . . , y^-i 
(with N = q m for some to > 1) achieve the optimal rate of convergence of the integra- 
tion error of 0(N~ (log7V) 1?s ) for functions which have partial mixed derivatives up 
to order ■& which are square integrable as long as 1 < d < d (Corollary 15.51 is actually 
more general) . If -d > d no improvement of the convergence rate is obtained compared 
to functions with smoothness § = d, i.e. we obtain a convergence of (D(N~ d (log N) ds ). 
Similar, but less general results for periodic functions compared to those in this paper 
have been shown in [2] by a different proof method. (The construction above is an 
example of a construction method which can be used. In Section [4] we outline the 
general algebraical properties required for the construction of suitable point sets.) 

The quasi-Monte Carlo algorithm based on digital nets and sequences proposed 
here has also some further useful properties. For example our results also hold if one 
randomizes the point set by, say, a random digital shift (see for example [4l [51 fT7]). 
(This follows easily because the worst-case error (see Section [5J is invariant with 
respect to digital shifts in the Walsh space and hence we obtain the same upper 
bounds for randomized digital nets and sequences.) In summary the quadrature rules 
have the following properties: 

• The quadrature rules introduced in this paper are equal weight quadrature 
rules which achieve the optimal rate of convergence up to some log N factors 
and the result holds for deterministic and randomly digitally shifted quadra- 
ture rules. 

• The construction of the underlying point set is explicit and suitable point 
sets are available in arbitrary high dimensions and arbitrary high number of 
points. 

• The quadrature rules automatically adjust themselves to the optimal rate of 
convergence 0(iV~ 1 '(logiV) s '') as long as 1 < $ < d. 

• The underlying point set is extensible in the dimension as well as in the 
number of points, i.e., one can always add some coordinates or points to an 
existing point set such that the quality of the point set is preserved. 

In the following we lay out some of the underlying principles used in this work 
which stem from the behaviour of the Walsh coefficients of smooth functions. Walsh 
functions are piecewise constant wavelets which form an orthonormal set of £2([0, l] s ). 
In their simplest form, for a non- negative integer fc with base 2 representation k = kq + 
■ ■ • + K m _i2 m_1 and an x £ [0, 1) with base 2 representation x = x\l~ x + x 2 2~ 2 + ■ ■ ■ , 
the fc-th Walsh function in base 2 is given by 



walfc(:r) = (-1) 



K X!-\ hK„ 



(Later on we will use the more general definition of Walsh functions over groups.) 

The behaviour of the Fourier coefficients of smooth periodic functions is well 
known, i.e. the smoother the function the faster the Fourier coefficients go to zero 
(see for example [34]). An analogous result for Walsh functions has, to the best of the 
authors knowledge, not been known until now (see Fine [10] who, for example, shows 
that the only absolute continuous functions whose fc-th Walsh coefficients decay faster 
than 1/fc are constant functions). This will be established here and subsequently be 
exploited to obtain quasi-Monte Carlo rules with arbitrary high order of convergence. 
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To give a glimpse of how the Walsh coefficients of smooth functions behave, 
consider for example the Walsh series for 1/2 — x: 

oo oo 

1/2 - x = ^2 c fcwal fc (.T) = Y^ 2~ a ~ 2 wal 2 a (x). 

k=0 a=0 

Although the function is infinitely smooth, in general the decay of the Walsh coefficient 
is only of order 1/fc. But note that most of the Walsh coefficients are actually 0. For 
example when we consider (1/2 — x) 2 , then typically we would have that the Walsh 
coefficient of k = 2 a is of order 2~ a , the Walsh coefficient of k = 2 ai + 2 a2 (oi > o 2 ) 

is of order 2" ai_a2 and for k = 2 ai + 2° 2 -\ h 2 Q " with oi > • • • > a v and v > 2 

the fc-th Walsh coefficient would be 0. By considering (1/2 — a;) 3 , (1/2 — a;) 4 , . . ., or 
more generally polynomials, one can now realize that the speed of convergence of 
the Walsh coefficients depends on how many non-zero digits k has. This is the basic 
feature which we will relate to the speed of convergence of the Walsh coefficients for 
smooth functions. 

Subsequently we will explicitly state and use the behaviour of the Walsh coef- 
ficients of smooth functions. In general the Walsh functions depend on the base q 
digit expansion of the wavenumber k and also of the point x where the Walsh func- 
tion is to be evaluated. Hence, maybe not surprisingly, the value of the fc-th Walsh 
coefficients of smooth functions also depend on the q-adic expansion of k. We show 
that the Walsh space £ s ,q,&,~f introduced in Section [3] contains all functions whose 
partial mixed derivatives up to order 6 < i? exist and have finite variation, where $ is 
a parameter restricting the behaviour of the Walsh coefficients of the function space 
£s,q,$,i- (We use a similar, though much more general, technique as Fine [TU] used 
for showing that the Walsh coefficients of a differentiable function cannot decay faster 
than 1/fc.) 

The concept of digital (t, a, /3, to, s)-nets and digital (t, a, /3, s)-sequences (see Sec- 
tion @] and also [5] for a similar concept) is now designed to yield point sets which 
work well for the Walsh space £ s>qt # n , just in the same way as the digital nets and 
sequences from [5J HH [SU1 1231 EH] are designed to work well for the spaces for example 
considered in [H [IT] (or as lattice rules are designed to work well for periodic Korobov 
spaces). Here the power of the result that the Walsh space £ s ,q,-& n contains smooth 
functions comes into play: it follows that we can fully exploit the smoothness of an 
integrand using digital (t, a, a, to, s)-nets or digital (t, a, a, s)-sequences. As the con- 
struction of the points y , y 1 , . . . introduced at the beginning yields explicit examples 
of digital (t, a, a, to, s)-nets or digital (t, a, a, s)-sequences as shown in Section [4] we 
therefore obtain explicit constructions of quasi-Monte Carlo rules which can achieve 
the optimal order of convergence for arbitrary smooth functions. 

In the next section we introduce Walsh functions over groups and state some of 
their essential properties. 

2. Walsh functions over groups. In this section we give the definition of 
Walsh functions over groups and present some essential properties. Walsh functions 
in base 2 were first introduced by Walsh [33], though a similar but non-complete set 
of functions has already been studied by Rademacher [37] . Further important results 
were obtained in [TDJ. We follow [26] in our presentation. 

2.1. Definition of Walsh functions over groups. An essential tool for the 
investigation of digital nets are Walsh functions. A very general definition, corre- 
sponding to the most general construction of digital nets over finite rings, was given 
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in |15j . There, Walsh functions over a finite abelian group G, using some bijection ip, 
were defined. Here we restrict ourselves to the additive groups of the finite fields ¥ p r , 
p prime and r > 1. We restate the definitions for this special case here for the sake 
of convenience. In the following let N denote the set of positive integers and No the 
set of non-negative integers. 

Definition 2.1 (Walsh functions). Let q — p r , p prime, r G N and let ¥ q be the 
finite field with q elements. Let Z g = {0, 1, . . . , q — 1} C Z and let <p : Z g — > ¥ q be a 
bijection such that p(0) — 0, the neutral element of addition in ¥ q . Moreover denote 



by ip the canonical isomorphism (described below) of additive groups ip : ¥ q 



and define n : 

Wi(Xl,...,X r ) 



Ip O (p. 
Xi . 



For 1 < i < r denote by TTi the projection 7^ : 17 



Z 



Z„ 



■F„ 




Z!' 



-*-Z„ 



Let now k G No with base q representation k = kq + n\q + • • • + K m _ig m_1 where 
ki G Z g and let a; G [0, 1) with base q representation x = x\/q + X2/q 2 + • • • (unique in 
the sense that infinitely many xi must be different from q — 1). Then the fc-th Walsh 
function over the additive group of the finite field ¥ q with respect to the bijection tp 
is defined by 



F,, v wal fe (x) = exp 



2n± 
V 



1=0 »=i 



TTi °f7)(«j)(7Ti ory)(a;; + i) 



For convenience we will in the rest of the paper omit the subscript and simply write 
walfc if there is no ambiguity. 

Multivariate Walsh functions are defined by multiplication of the univariate com- 
ponents, i.e., for s > 1, x = (x±, . . . ,x s ) G [0, l) s and fc = (kx,...,k s ) G Ng, we 
set 

s 

walfe(») = ^Qwalfe^Xj). 
3=1 

We now briefly describe the canonical isomorphism. Let ¥ q = Z p [0], such that 
{1,6, ... , 6*' -1 } is a basis of ¥ q over Z p as a vector space. Then the isomorphism ijj 
between ¥ q and 1L shall be given by 



ip(x) = (xi,...,x r ) 



for a; = 2, x i 



1 »Ci t tLir)' 



For more information on the Walsh functions defined above see |26j . 

We summarize some important properties of Walsh functions over the additive 
group of a finite field which will be used throughout the paper. The proofs of the 
subsequent results can be found e.g. in [TBI [25] ( see also [I])- In the following we call 
x G [0,1) a q-adic rational if x can be represented by a finite base q expansion. 



Proposition 2.2. Let p, q, ¥ q and p be as in Definition 1 2. 1\ For x,y with 
q-adic representations x = ^2 i=w Xiq~ l and y = ^2 i=w yiq~ l , w G Z (taking w neg- 
ative, hence the following operations are also defined for integers), define x ® v y '■= 
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YnL w z iQ~ l where z i '■= <P~ 1 (<P(^i) + <p(yi)) and £>v x : = Y^ w v i1~ l w here Vi := 
<^ _1 (— tp(xi)). Further we set x Q v y := x © v (QipV)- For vectors x,y we define the 
operations component-wise. Then we have: 

1. For all k, I £ No and all x,y €E [0,1), with the restriction that ifx,y are not 
q-adic rationals then x ffi v y is not allowed to be a q-adic rational, we have 

walfc(x) • wal/(x) = wal ke ^i(x), wal k (x) ■ wal k (y) = wal k (x ® v y) 

and, with the restriction that if x, y are not q-adic rationals then x Q v y is 
not allowed to be a q-adic rational, 



wal fc (a;) ■ walj(ai) = wal keip i(x), wal fe (x) • wal fe (y) = wal k (x G v y). 
2. We have 

gwaW*) = {° * l *°> 



3. We ho 



•l 
walo(ai) da; = 1 and I walfc(x) dx = if k > 0. 



io Jo 

4- For all k,l eNq we have the following orthogonality properties: 

/ wal k (x)wali{x) dx = \ 
7[o,i) a |0 otherwise. 

5. For any f S £ 2 ([0, l) s ) and any a g [0, l) s we have 



/ f{x)dx = / f{x® v cr)dx. 

J[0,1) S "Vl) a 



6. For any integer s > 1 the system {wal*, : fe G Ng} is a complete orthonormal 
system in £ 2 ([0, l) s )- 

Remark 2.3. The restrictions in item 1. was added to exclude cases like: x — 
(0.010101 . . .) 2 , y = (0.0010101 . . .) 2 and x © y = (0.1) 2 , for which the result is of 
course not true. On the other hand, the result holds for x © y = (0.0111111 . . .) 2 . 

Throughout the paper we will use a fixed bijection ip and a fixed finite field F g is 
used for Walsh functions and © v and Q v . Hence we will often write © and instead 
of (B v and Q v . 

In the following section we will deal with Walsh series and Walsh coefficients, 
which we briefly describe in the following: functions / € £ 2 ([0, l) s ) have an associated 
Walsh series 

f( x ) ~ Yl f( k ) wal k(x), 

keN s „ 

where the Walsh coefficients f(k) are given by 

f( k ) = / /(x)walfe(a;)dx. 

For smooth functions the Walsh series converges to the function, which is shown in 
Section [ 
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3. Walsh spaces containing smooth functions. In the following we inves- 
tigate how the Walsh coefficients of smooth functions decay and subsequently we 
use this to define function classes based on Walsh functions which contain smooth 
functions. But first we introduce a suitable variation. 

3.1. A generalized weighted Hardy and Krause variation. In the follow- 
ing we generalize the Hardy and Krause variation which suits our purposes later on. 

3.1.1. Holder condition. A function / : [0, 1) — > R satisfies a Holder condition 
with coefficient < A < 1 if there is a constant Cf > such that 

|/(s) - f(y)\ < C f \x - y\ x for all x,y € [0, 1). 

The right hand side of the above inequality forms a metric on [0,1). When one 
considers the higher dimensional domain [0, l) s then \x — y\ is changed to some other 
metric on [0, l) s . Here we consider tensor product spaces and we generalize the 
Holder condition to higher dimensions in a way which is suitable for tensor product 
spaces in our context. Consider for example the function f(x) = Ylj=i fji x j)i where 
x = (xi, . . . , x s ) and each fj : [0, 1) — > K satisfies a Holder condition with coefficient 
< A < 1. Then it follows that for all ^ u C S := {1, . . . , s} we have 

n i£(*i) - wvi)\ ^ n c u n \^ - %i a (^ 

jeu jeu jeu 

for all Xj,yj € [0, 1) with j G u. But here n?=i \ x j ~ Vj\ i s n °t a metric on [0, l) s . 
Note that we have 



Y[\fj(xj) - fjiVj 



^(-l)M-MJJ Mxj) J] fj( yj ) 

vCu j£v j£u\v 



(3.2) 



which can be described in words in the following way: for given ^ u C S let 
Xj,yj G [0, 1) with Xj 7^ yj for all j G u; consider the box J with vertices {{o-j)jeu '■ 
a j = x j or a j = Vj ^ or £ u }- Then p.2p is the alternating sum of the function 
EL'su fj a ^ ^ ne vertices of J where adjacent vertices have opposite signs. This sum 
can also be defined for functions on [0, l) s which are not of product form. 

Indeed, let for a subinterval J = rL=i \ x j i Uj) vvith < Xj < yj < 1 and a function 
/ : [0, l) s — > K the function A(/, J) denote the alternating sum of / at the vertices 
of J where adjacent vertices have opposite signs. (Hence for / = YVj=i fj we have 

A(/,J) = ni=i(/i(*i)-/ J -(y J -))0 

3.1.2. Generalized Vitali variation. Let p > 1. Then we define the general- 
ized variation in the sense of Vitali with coefficient < A < 1 by 



v r A ( ,; ) (/) = S u P (^voi(j) 



A(/,J) 



Vol(J) ; 



i/p 



(3.3) 



where the supremum is extended over all partitions V of [0, l] s into subintervals and 
Vol( J) denotes the volume of the subinterval J. 

Note that for A = 1 and p = 1 one obtains the usual definition of the Vitali 
variation, see for example |20j . If we take p = oo, then we obtain a condition of the 
form (|3.1[) where u = S and where we can take the constant Ylj=i @fj = l^if/)- 
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For s = 1 and p = oo we obtain a Holder condition with coefficient < A < I. In this 
sense we can view (J3.3I) as a fractional Vitali variation of order A. 

For A = I and if the partial derivatives of / are continuous on [0, l] s we also have 
the formula 



V$(f) 



for all p > I. Indeed we have 



|A(/,J)| 




d s f 



dx\ ■ • ■ dx s 



i/p 



dec 



(3.4) 



d s f 



dxi ■ • ■ dx. 



■(x) dx 



Vol(J) 



d s f 



dx\ ■ ■ ■ dx. 



-(G) 



for some C,j G J \ which follows by applying the mean value theorem to the inequality 



mm 
xe.J 



d s f 



dx\ ■ • ■ dx s 
Therefore we have 



Jev 



(x) 



< Vol(J)" 



d s f 



Vol(J) 



A(/,J) 



Vol(J) 



dxi ■ ■ • dx. 



= ^Vol(J) 
Jev 



■(x) dx 



< max 

xe.J 



d s f 



dxi ■ • • dx, 



-(x) 



d s f 



dx\ ■ ■ ■ dx. 



-(C 



9 s f 



dxi ---Oxs 



dx and thus the equality 



which is just a Riemann sum for the integral Jj lla 
follows. 

Using Holder's inequality and the fact that (E./ e p(Vol(J) 1 " 1/p ) p/(p " 1) ) 1_1/P = 
(J2jev Vol(J)) 1 ~ 1/p = 1 it follows that 

V&k/)<V$(/) forallp>l. 

3.1.3. Generalized Hardy and Krause variation. Until now we did not take 
projections to lower dimensional faces into account (in (|3.1[) we did take projections 
into account as we considered all ^ u C S). 

For ^ u C S, let U A " (/„; u) be the generalized Vitali variation with coefficient 
< A < 1 of the | u | -dimensional function f u (x u ) — J, Q ^s-iui f(x) dx s \ u . For u = 

we have f$ — f, Q 1)s f(x) dxg and we define V^ (f$; 0) = |/0|. Let q > 1, then 



i/q 



vw/)= EK" 11 ^"))' 



,«c<s 



(3.5) 



is called the generalized Hardy and Krause variation of / on [0, 1} S . 

For A = p = q = I one obtains an unanchored version of the usual definition 
of the Hardy and Krause variation, see 20 . A function / for which Vx,p,q(f) < 00 
is said to be of finite variation with coefficient A. (We remark that in some cases it 
might be appropriate to leave out the term corresponding to u = in Q3.5p . but here 
this term will be needed later on and hence we include it already in the definition of 
the variation.) 
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3.1.4. Generalized weighted Hardy and Krause variation. As first sug- 
gested in [30] (see also [BJ) different coordinates might have different importance, 
hence we can also define a weighted variation. In the spirit of the weighted Sobolev 
spaces in [BJ, let 7 = (7 u ) u cn be an indexed set of non- negative real numbers. Then 
we define the weighted variation V\ t p t q,~f(f) of / with coefficient < A < 1 by 



Vx,m,t(/)=(E^ 1 K? I) (/«;«)) , J 

\uCS J 



i/q 



Note that for A = 1 and p = q = 2 the weighted variation Vx,p,q t y(f) coincides 
with the norm in a weighted unanchored Sobolev space for any function in this Sobolev 
space, i.e, we have the identity V\,2,2,i{f) = ||/||sob, where 



ll/IU = 



,uCS 



[0,1)I"I 



[0,1)<> 



—*Z dx s\u 



1/2 



dx u 



denotes the norm in the weighted Sobolev space (see [BJ for more information on this 
Sobolev space). 

3.2. The decay of the Walsh coefficients of smooth functions. We are now 

ready to show how the Walsh coefficients of smooth functions decay. This behaviour 
is essentially captured in Dcfinition l3.5l below. But before we get there we need several 
lemmas to prove the result. The following lemma is needed to show how the Walsh 
coefficients of functions with bounded variation decay. A simpler version of it was 
shown in |25[ Lemma 4] . 

Lemma 3.1. Let f G £i([0,l) s ) and let k = (ki,...,k s ) G N s withkj = K J q a ^ 1 + 
k'j where Qj G N, Kj G {!,..., q- I}, < fc' < q a '~ l and let < Cj < q a >~ 1 for 



J = l, 



The 



nf= 1 [<y<r a ' +I ,(o j +i)g-^ +1 ) 



<„-E ] I = i(%-i) 



f(x) walfe(x) da; 
where the supremum is taken over all boxes of the form 

» _<y+1 ,(c i + l)cr 0i+1 ) 



./ - «■-> -sup|A(/,J)| 
j 



J=Y[[d jt e j )c'[[[c j q 

3=1 j'=i 



with q aj \ej — dj\ G {1, 



!}• 



Proof. We have wal* 



wal. 



Kj q 



walfe'. and the function wal^. is constant on 



each subinterval [cjq aj+1 ,(cj + l)q a i +1 ). Hence we have 



m =1 [c 3 q-^ + \(c J + l)q-^ + ') 



f{x) walfe (a;) da; 



m=i[^? - ^ +1 ,(ci+i)?-^ +1 ) 



i /(») JJwal^-ifoJda: 



i=i 



Note that the function wal a., -1 is constant on each of the subintervals [rjq aj , (tj + 
l)q~ aj ) for rj = 0, . . . , q aj — 1 for j = 1, . . . , s. Without loss of generality we may 
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assume that Cj = 0, for all other c,- the result follows by the same arguments. Thus 
we have 



iTj=i[0,<T^ 
g— 1 a 



i /(as) J|wal Kjg a 3 -i(^)da; 



i=i 



e n wai ^^/^/ 

... — n ~ — i "M 



ri,...,r s — Oj — 1 



nj=lh'9 aj ,(r 3 +l)q "i) 



f(x)dx. 



Let now a^,...,^) = / ns _ i[rjg -a j(rj+1)g - nj) /(a:)da; and for agivenO <rx,...,r s < g 
let 

9-1 _ 
A( n ,...,r s ) = g- S E E (- 1 ) |u 'o(*«,r {1 . n „)> (3-6) 

*l,...,ta=O0^uC{l,...,s} 

where (£„, r^ ... iS }\ u ) denotes the vector obtained by setting the j-th coordinate to tj 
if j G u and Tj if j ^ w. Further let 

9-1 
B(n,...,r.)=«-' E E (- 1 ) |t, 'o(*.,r <1 a >vJ- 

tl,...,ia=0 U C{l,...,s} 

Then we have 

9—1 s 

E II Walrc i( r 'j/9)a(r 1 ,...,r s ) 

ri ,...,r s — j — 1 

q—X s 

=- e n wai ^^/^ A ( ri '---' rs ) 

ri,...,r s =0 J — 1 

g— 1 s 

ri,...,r s — j — 1 

Since X)r=o wa ^{ r /l) = an( i ^4( r ii ■ • • > r s) is a sum where each summand does not 
depend on at least one rj, i.e. the case u — is excluded in (|3.6[l . it follows that 
the first sum on the right hand side above is zero. Further we have a< ri ,...,r s ) + 
A(n,. . . ,r s ) = B(n, ...,r s ) and thus 



9—1 s 

E II Wal %( r i/«) a (ri,..,ra) 
ri,...,r s — j — 1 



9-1 

< E |£(ri,...,r,)| 

ri,...,r s — 



We have 



i?(ri,. ..,r s )\ < max 

te{o,...,g-i} s 






\M. 



^ (-l) m a (tu>r{1 s}Xu 

*C{l,...,s} 
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Therefore we have 






9-1 

* E 

r\ ,...,r s — 



max 

te{o q-i} s 



E (~ l ) lUla (t„r {1 .^) 

jC{l,...,s} 



< q s max 

r,te{0,...,8-l}" 



E (- 1 ) |u| «(t u ^ 1 =>w 

uC{l,...,s} 



Note that if in the above maximum there is a j such that rj = tj then it follows that 



E (-^V,^ .*.> 

uC{l,...,s} 

Hence we may in the following assume without loss of generality that the maximum in 
the last line of the inequality above is taken on for r = (r%, . . . , r s ) and t = (ii, . . . , t s ) 
which satisfy rj ^ tj for j = 1, . . . , s. 
We have 



E (- 1 ) |U| 0(*-,r {1 . 
«C{l,...,s} 



»}\" 



J2 (~l) M f(x + y u )dx 

nj =1 [^9 ^,(^+1)9 aj ) a c{l,...,s} 

where y u = (j/i, . . . , y s ) with yj = for j <£ u and ?/j = (tj — rj)q~ aj for j G u. We 
can write 

J2 (~l) M f(x + y u ) = A(f,J x ), 

uC{l,...,s} 

where J x = riy=i[ mm ( a 'j> a 'j + (^ — r j) c T ai )i niax(Xj, a^ + (tj — rj)q~ aj )). Therefore 
it follows that 



«C{l,...,s} 



(-1) a (t„,r {1 ,..., s}x „) 



<! -£ J- iaj sup |A(/,J X )| 



The result follows. D 

In the following lemma we now obtain a bound on the Walsh coefficients for 
functions of bounded variation. It is a generalization of |25| Proposition 6] . 

Lemma 3.2. Let < A < 1 and let f G C 2 {[0, l) s ) satisfy Va,i,i, t (/) < oo. Then 
for any k G Nq \ {0} the k-th Walsh coefficient of f satisfies 

\f(k)\ < q^- X ^e^-^f(f u ;u), 

where k = (fei, . . . , k s ), u = {1 < j < s : kj ^ 0} and for j G u we have kj = 
n j q a ^ 1 + kj, where Kj G {1, . . . , q — 1}, Qj G N and < fc'- < q a i~ 1 . 
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Proof. Let / e £ 2 ([0, l) s ) with fc-th Walsh coefficient f(k). First note that it 
suffices to show the result for k £ N s , as otherwise we only need to replace the function 
/ with the function f u (x u ) — /,„ ,w_|,,| f(x) dxs\ u - Hence let now k £ W be given 
and let k' = (k[, . . . , k' s ). Then we have 



[0,1) 



f(x) walfe(a;) da; 



< 



E 



a<Cj<q o 

1<]<B 



(c 1 + l)q-"-l + 1 r (c 3 + l)q-"s + 1 



Ciq 



f(x) walfe(a;) da; 



c B g- 



Now we use Lemma 13711 and thereby obtain that the above sum is bounded by 



0<c 1 <q a l- 1 0<c s <<j a s-l 



sup|A(/,J)|, 
j 



where the supremum is taken over all boxes J — YVj=i [4j' e i) — ll^=i[ c i'? ° i+1 1 ( c j 
l)g- a j+!) w ith q aj \e — dj\ £ {1,. . .,q — 1}. Now we have 

? - EJ^-i) sup |A(/,J)| <sup £ VoK/J^IAC/.J)! 






/ePc 



Vol(I) 



where the supremum on the right hand side is taken over all partitions V c of the cube 
Q^ =1 [cj-g~ ai+1 , (cj + l)q~ aj+1 ) and where each I £ V c is of the form I — Ylj = i[xj,yj) 
with q a '\y ] - Xj\ £ {l,...,q-l}. We have q-^=i a 3 < Vol(I) < g-^J=i(^-i) an( j 
therefore 



■E|=i(«j-i) 



< Vol(I)V < g s - A ^=i( Q J - 1 ) 



Vol(/) 1 - A 
and hence 

g-EJ-ifo-^suplAtf, J) | < g— ^EJ-iCai-lJgup ^ Vol^) 1 ^ A(/, /)|. 



/e-Pe 



Note that 



^ sup £ Vol(/) 1 - A |A(/,/)| < sup £ Vol(J) 



0<Cj<q 3 
1<J<S 



;ep c 



JeP 



|A(/,J)| 
Vol(J) A 



where the supremum on the left hand side is taken over all partitions V of the cube 
rn=i[ c j9~ aj+1 > ( c j + l)q~ aj+1 ) into subintervals and the supremum on the right hand 
side is taken over all partitions of [0, l) s into subintervals. Thus the result follows. D 
For the next lemma we will need the following two functions. For k £ {1, . . . , q— 1} 
let now 

9-1 
V K = ^2rwal K (r/q). 

r=0 
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If q is chosen to be a prime number and the bijections tp and r\ are chosen to be the 
identity, then v K = q(e 27riK /<i - l) -1 , see @J Appendix A]. 

Further for I £ {1, . . . , q — 1} we define the function ( a {x) = X)r=o wa M r /9): 
where a > 1 and x = x\q~ l + X2q~ 2 + • • • and where for x a = we set Ca(a^) = 0. The 
function £ a depends on x only through x a , thus it is a step- function which is constant 
on the intervals [cq~ a , (c + l)q~ a ) for c = 0, . . . , q a — 1. By (25j Proposition 5] it 
follows that £ a can be represented by a finite Walsh series. Indeed, there are numbers 
Co, ... , c ? _i (which depend on I but not on a) such that 

9-1 

(a(%) =^]c 2 wal 

z=0 



,= </" 



(s). 



If q is chosen to be a prime number and the bijections ip and 77 are chosen to be the 
identity, then ( a (x) = (1 - wal ig a-i(a:))(l - wal/(l/g))~ 1 , i.e., c = (1 - wal ; (l/g)) _1 , 
c; = (wal;(l/<2) — 1) _1 and c z — for z ^ 0,i. 

The following lemma will be used in the induction step for differentiable functions. 
For example, for a differentiable function F : K —¥ M. given by F(x) — L f(y) dy we 
can calculate the Walsh coefficients using integration by parts in the following way: 
for k > we have 



F(k) 



F(x)walk(x) dx 



wal k (y)dyF(x) 



f(x) I ws\ k {y)dydx, 
o Jo 



f{x) I wal fc (y)dyda; 

i> Jo 

(3.7) 



where we used J Q walfc(x) dx = J waU(x) dx — 0. For k = on other hand we obtain 



F(0)= F(x)wal (x)dx= / wal (y) dyF{x) 
f(x)dx- / f(x) I wal (y)dydx, 



f(x) / wal {y)dydx 

ii Jo 

(3.8) 



Thus if we know the Walsh series for /, then we can easily calculate the Walsh 
series for F, provided that we know the Walsh series for L walfc(y) dy. This will be 
calculated in the following lemma. It appeared in a simpler form in 10 . 

Lemma 3.3. For k e N and x E [0, 1) define Jk{x) = J* walfc(y) dy. For k > 1 
let k = lq a ~ x + kl where I G {1, . . . , q — 1}, a > 1 and < k' < q a . Then J k can be 
represented by a Walsh series which is given by 



q-l 



oo q—X 



Jk{x)=q a \ ^c z wal zg a-i +fc ,(a;) + 2 1 walfc(a;) +X)X) ' 



v K w&l Kq a+c-i +k (x) 



\z=0 

Further we have 



oo q— 1 

J (x) = 1/2 + ^2 Yl 9 _c_1 ^k wal Kg c-i (x). 

C=\ K = l 



Proof. Let k = Iq 



a-l 



k! with a > 1, < k' < q a ~ l and / e {1 



1}. The function wal;„ a -i(y) is constant on each interval [rq a ,(r + l)q a ) and 
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walfc> (y) is constant on each interval [cq a+1 ,(c+ l)q a+1 ). We have w&\ k (y) 
wal; g a-i(y) walfc'(y). For any < c < q a ~ l we have 

/ wal fc (y)dy = wal fe -(cg- a+1 ) / w&\ lq a-i (y) dy 

J[cq-°-+ 1 ,{c+l)q-o + 1 ) J[cq-»+ 1 ,(c+l) 9 -»+ 1 ) 

9-1 

= wal k ,{cq- a + 1 )q- a ^2wn\i(r/q) 

r=0 

= 0. 
Thus we have 



J k (x) =wa,l k '{x)Ji q a-i(x). 

Let x = a;i<7 _1 + X2q~ 2 + ■ ■ ■ and y = x a+ ig _1 + x a+ 2<7~ 2 H , then we have 

J lqa -i(x) = q~ a ^2 walj(r/g) + q~ a wali(x a /q)y. 

r=0 

We now investigate the Walsh series representation of the function J; g a-i (a;). First 
note that wa\i(x a /q) = wal/ g «.-i(x). Further, by a slight adaption of [4} eq. (30)] we 
obtain 

oo q—1 

y = l/2 + Y,Yl 1~ C ' lv - wal Kg c-i ( y ). (3.9) 

C— 1 K— 1 



As wal K(? c-i (y) = wal K(? a+ c -i(x) we obtain 

oc q— 1 



C— 1 K— 1 



x a -l- 



As noted above, the Walsh series of ( a (x) = J2r=o wal/(r/g), where for x a = 
we set Ca(x) = can be written as 

9-1 

Ca(x) = 2J c z wal zr -i(x). 

z=0 

Altogether we obtain 

9-1 

q a Ji q a-i (x) = J^ c 2 wal 2g a-i (x) + 2~ 1 wal /g a-i (x) 

2=0 

oo 9— 1 

+ Xm <r C ~ 1 W K Wal K9 a + c-l_ Hg a-l (X) 
C=l «=1 

and therefore 

q— 1 00 9— 1 

q a J k (x) =^2c z w&\ zq a-i +k ,(x) +2 _1 waU(a;) + J^ J^ g _c_1 u K wal Kg o+c-i +fc (x). 

2 = C=l K=l 
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The result for k = follows easily from (|3.9p . D 

Note that Lemma 13.31 can easily be generalized to arbitrary dimensions s, since 
for k = (fci, . . . , k s ) £ Ng we have for any x = (xi, . . . , X s ) £ [0, l) s that 



Jk(x) = / walfc(y) dy = TT J kj (xj), 
J[o,x) x = i 



where [0,x) = n^il ^)- 

The next lemma shows how the Walsh coefficients of a function F = J f can be 
obtained from the Walsh coefficients of /. 

Lemma 3.4. Let f £ C 2 ([0,l) s ) and let F(x) = L , /(y) dy, where [0,x) = 

Y[i=ii^T x j) with x = (xi, . . . ,X S ). Further let F(k) denote the k-th Walsh coefficient 
of F. Let k = (fci, . . . , k s ) £ Ng and let U = {1 < j < s : kj ^ 0}. For j £ U let 
kj = ljq a J~ l + kj, < lj < q and < kj < q a '~ l and further let k! = (k[, . . . , k' s ) 
where k[, — for j <£ U . Then we have 

F(k)=q-^v a * J2 (- 1 ) 1 " 1 E Hk' + (h v ,0)) X u,v,k(hv), 

UQvcs hvEfi M 

where (h v ,0) denotes the s-dimensional vector whose j-th component is hj for j £ v 
and otherwise and where for h v = (hj)j^ v £ W we set 

Xu,v,k{h v ) = Y[pk 3 -(hj) Y[ 0(M- 

jeu jev\u 

Here 

2- x \ z=h forhj^zq^- 1 , 
Pk 3 { h j)={ v z q- % - Y for h = zq a i- l+l +l jq a i-\i> 0,0 < z < q, 

otherwise, 

where l z =i = 1 for z = lj and otherwise, and 

for hj = 0, 
<j)(h 1 )={ Vzq-*- 1 forh = zq a J- 1+l ,i>0,0<z<q, 
otherwise. 

Proof. Using integration by parts in each coordinate, Fubini's theorem and 
Jfc(0) = Jfc(l) = for any k £ N (see Equations (|3.7[) and Q3.8P for one-dimensional 
examples) it follows that 

F(k)= [ F(x)w a \ k (x)dx= V (-l)M / J kv (x v )f(x)dx, 

J i°' 1 ) S UCvCS J ^ ^ S 

where for k = (ki, . . . ,k s ) and x = (xi, . . . ,X S ) we have Jh v (x v ) = Y[ j&v Jk 3 (xj)- 

Using the Walsh series expansion of J k given by Lemma 13.31 we can now express 
the Walsh coefficient F(k) as a sum of the Walsh coefficients f(h), from which the 
result follows. □ 
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The following definition now captures the essence of the decay of the Walsh co- 
efficients of smooth functions and will be used in the statement of the subsequent 
lemmas, theorems and corollaries. 

Definition 3.5. Let k = k{v;a\, . . . ,a v ) = Kiq - 1 ^ 1 + • • • + K v q av ~ 1 with v > 1. 
Ki, . . . , K v G {1, . . . , q — 1} and 1 < a v < • • • < a\ be a natural number. For k = we 
set v — 0, i.e., fc(0) = 0. A function B : No — > K is called q-adically non-increasing if 
B(k) — B(k(v; a%, . . . , a v )) is non-increasing in v and each a,i for i = 1, . . . , v, that is, 
for any v > we have 

B{k(v; a%,..., a v )) > B(k(v + l;a[,..., a' v+1 )) 

with 1 < a' v+1 < ■ ■ ■ < a[ and a\, . . . , a v S {a' 1: . . . , a' v+1 } and for an arbitrary 
1 < i < v we have 

B(k(v; a\,..., a v )) > B(k(v; oi, . . . , a;_i, a, + 1, Oj+i, . . . , o„)) 

provided that ai + 1 < a%-\ in case 1 < i < v. 

In the following lemma we give a bound on the Walsh coefficients of F if / satisfies 
some smoothness condition. 

Lemma 3.6. Let B : Ng — > [0, oo) be a q-adically non-increasing function in each 
variable. Let f e C2([0,l) s ) and let the Walsh coefficients of f satisfy 

\f{k)\<B(k) forallkeN s . 

Let F(x) = f [Qx) f{y)dy, where [Q,x) = YT j=1 [0,Xj) with x = (xi,...,X s ). 

Further let F(k) denote the k-th Walsh coefficient of F. Let k = (fei, . . . ,k s ) S 
N5 \ {0} and let U = {1 < j < s : kj ^ 0}. For j G U let kj = Ijq ^ 1 + ty and 
< k'j < q' 1 ^ 1 and further let k' = (k[, ... ,k' s ) where kj — for j <£ U. Then there 
is a constant C s ^jj > independent of k such that 

\F{k)\<C sJJ q-^v^B(k'). 



Proof. Using Lemma 13.41 we obtain that 

\F(k)\<q-^^^B(k') J2 E \Xu,v,k(h v )\, 

UQvCS hvm M 

as |/(fc' + (h v ,0))\ < B(k' + (h v ,0)) < B(fe') for all values of h v G N u| for which 
Xu,v,k(h v ) ^ 0, since B is g-adically non-increasing in each variable. 

Thus it remains to bound J2uc v cs ^h pn h \xu.v,k(h v )| independently of k. We 
only prove the case where q is chosen to be a prime number and the bijections ip and 
n are chosen to be the identity, as in this case we can obtain an explicit constant 
C s ,xj > 0. The general case can be obtained by similar arguments using the result 
from Lemma 13.41 

Using the notation from Lemma 13.41 we have 



J2 K(/»)| = |l-«^|- 1 + 2- 1 |l + a;,-''|| W ^-l|- 1 

h&N 

oo q—1 

t=l z =l 

< 3(2-2cos(2n/q))- 1/2 
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and 

oo oo q— 1 

E \<Kh)\ = 2" 1 + E ^ E ^^ - IP 1 < 2- 1 + (2 - 2 cos^/g))- 1 / 2 . 

/l = 1 = 1 2 = 1 

Therefore we have 



E E \xu,v,k( h v) 



U<ZvCS hveN lv 



< 3 |t7| (2 - 2cos(2 7 r/ (? ))-l c/ l/ 2 (3/2 + (2 - 2cos(2^/g))- 1 / 2 ) s ~l c/ l 
and hence we can choose 

C a ,v = 3 |c/| (2 - 2cos(2^/ ( ?))-l l/ l/ 2 (3/2 + (2 - 2cos(27r/ 9 ))- 1 / 2 ) s -l l/ l (3.10) 



in Lemma 13.61 for this case. D 

We use the above results now to establish an upper bound on the Walsh coeffi- 
cients of a polynomial. The proof will give a glimpse on how the argument will work 
for more general function classes. 

Lemma 3.7. Let k — Kiq" 1-1 ^ \-K v q av ~ 1 with v > 1, m, . . . , n v <e {1, . . . , <?— 1} 

and 1 < a v < • • • < a%. For v = let k = 0. Let f : [0, 1) — > R 6e </ie polynomial 
f{x) = /o + /ia; + • • • + /iX* wii/i fi^O and let f(k) denote the k-th Walsh coefficient 
of f . Then for v > there are constants < Cfj tV < oo smc/i that 



\f{k)\<C fti , v q-°--- a \ 

where we can choose Cf^ v — for v > i. 

Proof. Let f(x) = fo + fix-\ VfiX 1 , where i = deg(f) (that is, fi ^ 0). Then we 

have f^'(x) = Ufa ^ 0. As /W is a constant function, its Walsh series representation 
is simply given by f^'{x) = i\fi- Now we use Lemma l3.6l The dimension s in our case 
is 1 and we can choose the function £>i by £>i(0) = i!|/i| and for k > we set B\(k) = 0. 
Note the function B\ defined this way is a g-adically non-increasing function. Then it 
follows that there is a constant C\ > such that the Walsh coefficients of the function 
/* f {l) (t)dt = f^ l) {x) - /^-^(O) are bounded by Ciq~ a H\\fi\ for all k where t> = 1 
and the Walsh coefficients are for v > 1. The Walsh coefficient for fc = is given 

by / ( ^ 2) (l) - / ( ^ 2) (0) - / (I_1) (0)- 

Now consider the function /* f^Ht) dt = f {t ~ 2) (x) - / (l ~ 2) (0). It follows from 
the above and Lemma 13.61 that the Walsh coefficients of f( l ~ 2 \x) — /^~ 2 - ) (0) can 
be bounded by a q-adically non- increasing function £>2- Indeed there are constants 
C* 2 ,C*3 >0 such that we can choose B 2 (0) = |/ (i_2) (l)-/ (i_2) (0)-/ (i_1) (0)l. S 2 (fc) = 
C 2 q~ ai for v = l, B 2 (k) = C* 3 g- ai - a2 for w = 2 and B 2 (k) = for u > 2. Again B 2 is 
a g-adically non-increasing function and Lemma 13.61 can again be used. 

By using the above argument iteratively we obtain that there is a constant C > 

such that |/(fc)| < Cq~ ai Q ", for k = n x q ai ^ 1 -\ hK„9°" _1 with ki,...,k v € 

{1, . . . , q — 1} and 1 < a v < • ■ ■ < a\. The result thus follows. D 

For the case where q is chosen to be a prime number and the bijections tp and r\ 
are chosen to be the identity and for < v < i we can choose 

i 

C f!i , v =C v J2 cH ~ lll \fi\, (3-11) 

l=v 
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where (7 = (2 - 2 cos(2tt /q))- 1 ' 2 and C" = 3/2 + (2 - 2 cos(2 7 r/g))- 1 / 2 in Lemma GT71 
Let / : [0, l) s — > R be such that the partial mixed derivatives up to order 6 > 1 
in each variable exist and are continuous. We need some further notation: let t = 
(ti,...,t s ) and 

For x G {0, ...,(5} s let u(t) = {1 < j < s : Tj = 5}. Let 7 = (Tu^cN be an 
indexed set of non-negative real numbers. Let v(t) = {1 < j < a : Tj > 0}. Then 
for < A < 1 and p, q, t > 1 (p, q, r do not appear in the subscript of N as they do 
not have influence on our subsequent bounds, we only assume that they are bigger or 
equal to 1) we define 



i/t 



^)\)(f(-r)l 



aw/)= > 7 - ( io c;r (/ it; (-,o 5Wt) )) , (3.i2) 



where, for clarity, we introduce the additional superscript (|m(t)|) in the Hardy and 
Krause variation V^ " T { which indicates the dimension of the function and where 

for u(r) = we set V^(f^(;0 S \ u(T ))) = |/^(0)|. 

The weights 7 are introduced to modify the importance of various coordinate 
projections and were first introduced in [30] , see also [7] . If for some v' C S the 

weight 7„/ = 0, then we assume that the function / satisfies V^" ' i(/^('> Os\-u')) = 
for all r e {0, . . . , 6} s with v(r) = v' and in (|3"T2l we formally set 0/0 = 0. 
The parameters in the definition of Ns.\ n have the following meaning: 

• 8 denotes the order of partial derivatives of / required in order for Ns t x,-y{f) 
to make sense; 

• A is a Holder type parameter or fractional order type parameter of the general- 
ized Hardy and Krause variation; roughly, / needs to have partial derivatives 
up to order 6 + A, where for < A < 1 this means some type of fractional 
smoothness or in dimension one a Holder condition of order A; 

• the Vitali variation is in p norm; 

• q is the norm in the summation of the generalized Hardy and Krause variation; 

• r is the norm in the summation over the r; 

• 7 are the weights which regulate the importance of different coordinate pro- 
jections; 

Note that for A = 1 and p = q = r = 2 the functional Ns^x^-y is just the norm in a 
weighted reproducing kernel Sobolev space with continuous partial mixed derivatives 
up to order S + 1 in each variable. In one dimension the unweighted norm in this 
reproducing kernel Sobolev space is given by 

(/) 9)sob,a,n,S+l (3.13) 

= /(0)g(0) + --- + /^- 1 W A ~- 1 )(0) 

+ f f is \x)dx f g ( - s \x)dx+ f f s+1 Hx)g {S+1 \x)dx 
Jo Jo Jo 

and for higher dimensions one just takes the weighted tensor product of the one 
dimensional reproducing kernel Sobolev spaces (see [7] for examples of weighted tensor 
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product reproducing kernel Sobolev spaces). Let the s dimensional weighted inner 
product be denoted by (•, ■) S ob,a n ,s,5+i,-Y and the corresponding norm || • || so b,an, s ,5+i,7 
(indeed if the partial mixed derivatives up to order S + 1 of / are continuous on [0, l] s 

then we have ||/|| S ob,an, s ,<5+i,-y = -W<$,i,-r(/))- 

In the following we define a function /i which will be used throughout the paper: 
let 6 > 1 be an integer and < A < 1 be a real number. Then for k — (&i, . . . , k s ) we 
set 

5 

H q ,s+\{k) ='J2n qi s+\(k j ) (3.14) 

j=i 

with 

for k = 0, 

IJ>q,8+\(k) = { ai-\ V a v for v < S, (3.15) 

ai + ■ ■ ■ + as + Xas+i for v > 6 7 

where for k G N we write k = Kiq ai ~ l + • • • + Kyq^^ 1 with v > 1, ki, . . . , k v G 
{1, . . . ,<? — 1} and 1 < a v < • • • < a%. 

Theorem 3.8. Let 5 > 1 be an integer, 0<A<l,p,q,r>lfee real numbers and 
an indexed set 7 ~ (7t,)t,cN of non-negative real numbers be given. Let f : [0, l) s — > M 
be such that the partial mixed derivatives up to order S in each variable exist and such 
that Ns t \^(f) < 00. Then for any k £ Nq \ {0} it follows that there is a constant 
C/. ? . s ,-y > independent of k such that 



Proof. In order to prove the result we use the Taylor series expansion of the 
function /. We have 

/(«)= E ^/ (T) (o)+ E E ((*-i)»r w 

-re{0 : ...,<5-l} s ' 0^CS Ts \„e{O,...i-l}»-l«l 

rc^" 3 ? / f^^Hy^Os^Uix.-y^dy^ (3.16) 

lljes\u T J- J[0u,xu) jeu 

First note that the first sum in (|3.16p is a polynomial in x and therefore the 
Walsh coefficients of this polynomial satisfy the desired bound by Lemma 13.71 

Now we consider the second sum. Let 8^mC5 with u — {ji, . . . , j\ u \\ be given. 
Then for j ^ u the Walsh coefficients satisfy the desired bound by Lemma [3~7l Hence 
it remains to consider the Walsh coefficients of 

G u (x u ) = f f ls ^ s ^(y tt ,0 S \u) life ~ Vif' 1 A Vu 

^[0„,x„) jeu 

= [ f l * r " r ^ ) {V«,0s\u)l[(Xj-Vj) S + 1 *Vn 

•>[0»,1„) ieu 

where (x — y) + = max(0, x — y). 
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By differentiating the function G u in each variable < k < 6 times we obtain 
(see [33 pp. 153,154]) 

f\k\u\ 



dxl 



(5-1) 



Hence dxk G u (x u ) — if there is at least one j g u such that Xj — 0. Further we 
have 

qS\u\ 



dxi 



G u (x u ) = ((S - iy.)^f^^^(x u ,0 S \u). 



From Ns.\ n (f) < oo it follows that the Walsh coefficients of f( Su ' Ts \^(x u , Ogw) 
decay with order fi qi o + \(k) in each variable. Further we have 

G u (x u )= [ f •••/ G^\y s )dy s -.-d yi 

as the function G u and its derivatives are if at least one Xj = for j G u, i.e., we 
have 

/ G^(y)dy = J2(-V uV GL T - 1] (x v ,0 Av ) = G^K). 
J\o,x u ) ~rr. 



Hence it follows by repeated use of Lemma 13.61 that the desired bound holds for G u 
and thus the result follows from (|3.16p . D 

For the case where q is chosen to be a prime number and the bijections tp and rj 
are chosen to be the identity we can also obtain an explicit constant in Theorem 13.81 
Indeed, using Lemma T3.21 Lemma T3.6I together with the explicit constant (|3.10[) and 
Lemma 13.71 together with the explicit constant (|3.11j) we obtain that the constant 
C qn can be chosen as 



y v (T)^° 



J2 C^\^Vff\f^>^(;0 S \ u )), 

where C = 1 for 2 < q < 6 and C = (2 - 2cos(27r/<7))" 1 / 2 for q > 6 (note that for 
q > 6 we have C > 1) and 

C 8<u = 3 |n| (2 - 2cos(27r/g))-l"l/ 2 (3/2 + (2 - 2cos(2 7 r/ g ))- 1 / 2 ) ;5 -l"l. 

As noted above, under certain conditions we can write V^ " also as an integral, see 
(EH). 

We give a further useful estimation of the constant by separating the dependence 
of the function from the constants. This way we obtain 
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where 

C<5,g,s,7 = E r 7v(T)C Tl + '" +Ts 

i-e{o,...,8-i} s 

+ E ^ C U E to(r s g^^ Tj - (3.17) 

0#uC5 x SXu e{0,...,<5-l} s -l"l 

Consider now the case where the weights are of product form (see [7]), i.e., there is a 
sequence of positive real numbers ('Jj)je^ such that j v — Yljev^J ^ or au v ^- ^ an< ^ 
for u = we set j v = 1. If now q is prime with 2 < q < 6, then 



s s 

c s , q ,.„ = n^ +7i(* - 1)) + n [(! +^( (5 - !))( 3 / 2 + &)+iito& 

-n[( 1 + 7i(^-l))(3/2 + C')" . 
For example for g = 2, <5 = 1 and product weights we obtain 

s 

i=i 

The approach used here for prime q and <p the identity map can also be used for 
arbitrary prime powers q and arbitrary mappings ip with ip(0) — 0. Hence we obtain 
the following corollary. 

Corollary 3.9. Under the assumptions of Theorem \3.8\ there exists a constant 
Cs,q,s,-f > independent of k and f such that 

|/(fe)| < C Stq ,,„Ns,x,y(f)q-'*->+ xW for all k G Ng, 

where fi q ,s+x is given by \3. 1J$ and H3.15]) . 

Remark 3.10. The results in this section also hold for the following generaliza- 
tion. In the definition of Ns t \.*y we anchored the function and its derivatives at 0, 
i.e., we used Vj, " v"{ (/(•, Os\u(t)))- This can be generalized by choosing an arbitrary 

a G [0,l] s and using V^^Jj (/( - !<'5W(t))) m the definition of iV^-y. It can be 
shown that in this case we also have Theorem 13.81 and Corollary 



3.3. Convergence of the Walsh series. For our purposes here we need strong 
assumptions on the convergence of the Walsh series S(f)(x) = J2keN s /(k)walfc(a;) to 

the function /, i.e., we require that the partial series S m {f){x) = ^ fc^Ng /(fe)walfe(a;) 



"0 



converges to f(x) at every point x G [0, l) s as m — > oo. (Note that the Walsh 
series S(f) for the functions considered in this paper is always absolutely convergent, 
i.e., X)fceN s \f(k)\ < oc, hence the Walsh series S(f)(x) is uniformly bounded by 
SfcGN 3 1/(^)1 an( ^ therefore S(f)(x) itself converges at every point x G [0, l) s .) This 
is necessary as we want to approximate the integral at function values x n and for 
our analysis we deal with the Walsh series rather than the function itself, hence it is 
paramount that the function and its Walsh series coincide at every point x G [0, l) s . 
As the functions considered here are at least differentiable it follows that they 
are continuous and using the argument in JT01 p. 373] it follows that the Walsh series 
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really converges at every point x £ [0, l) s to the function value f(x). Indeed, for a 
given x £ [0, l) s we have 



V(/)(*)= E /(fe)wal fe (x) = Vol^)- 1 f f(x)dx, 

fee{0,...,? m -l}« Jj <* 

where J x — Y\ S j =1 [q^ m [q m Xj \ ,q~ m [q m Xj\ + q~ m ). The last equality follows from 
X] f{k)w&\ k (x) 

fce{0,...,g'™-l} s 

= / /(y) 52 wal fc (a;)walfc(y)dj/ 



fee{o,...,g m -i} 



'[o,i) s 

= Vol(J x )- i / /(asjdx. 
J j<* 

As the function / is continuous it immediately follows that S q ™(f){x) converges to 
f{x) as m goes to infinity and the result follows. 

3.4. A function space based on Walsh functions containing smooth 
functions. In this section we use the above results to define a function space based 
on Walsh functions which contains smooth functions for smoothness conditions con- 
sidered in the previous section. 

Let i? > 1 be a real number and q a prime power. Then for k £ N§ we set 
r q ^(k) = q-Vv.-oW^ where fi q ^ is given by (J3.14I) and (J3.15P (if $ is an integer, then 
choose A = 1 and 6 = t) — 1 and otherwise S = [-d\ and A = ■& — |_$J ) ■ 

Now we define a function space £ s q $ 7 C £2([0, l) s ) with norm || • \\g a # given 

by 



I,,,, _i \f(k u ,0 S \u 

ll/lk,,,*,, =max7„ sup ■ 



u c s 



fe„eNl"l r q^{k u ) 



\u\ 



where again for j u = 0we assume that f(k u , 0s\ u ) = for all k u £ 
The following result follows now directly from Corollary 13.91 
Corollary 3.11. Let 5 > 1, < A < 1, p,q,r > 1 and an indexed set 7 = 

(lv)vcW of non-negative real numbers be given. Then there exists a constant Cs. q . s ,~/ > 

such that for every function f : [0, l) s — > R, whose partial mixed derivatives up to 

order 5 exist, we have 

ll/lka, 9 ,«+X,-Y < C^.g.s.T^A.-yC/), 

where /j, qt s+x is given by {3. 14\ ) and 113.15]) . 

Again, using (13.17)) an explicit constant in Corollary 13.111 can be obtained for q 
prime and (p the identity map. For all other cases (i.e., arbitrary prime powers q and 
mappings ip with (p(0) = 0) explicit constants can be obtained as well, but in this 
case the constant may also depend on the particular choice of q and (p. Further, as 
noted already above, for A = 1 and p = q = r. = 2 the functional Ng^^ coincides 
with the norm in a certain Sobolev space (the one dimensional inner product for this 
Sobolev space is given by (|3.13[) and for higher dimensions one just considers tensor 
products of the one dimensional space) and hence it follows that £ a ,q,5+\,i contains 
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certain Sobolev spaces. Hence Corollary 13.111 shows that if we want to prove results 
for smooth functions it is enough to consider £ s q # ~ (in the following we design 
quasi-Monte Carlo rules which work well for £ s ,q,t9,-y rather than directly for smooth 
functions, so the results for smooth functions come as a byproduct). 

A function / G £ s , q ,-d,y can be written as a sum of their anova terms / = X)«cs /« 
(see [H]). For a function / E £ s ,q,d,-y given by f(x) — X)feeN s f(k)walk(x) the anova 
term f u corresponding to a subset u C S is simply given by 



fu(x u ) = ^2 f( k u,0 S \ u )wa\ kit (x u )- 



h u eNM 

If for some uC5we have "/ u = 0, then this implies that the anova term corresponding 
to u satisfies f u = 0. Hence the Walsh space £ s , q ,i),~ 1 consists only of functions whose 
anova term belonging to a subset u is zero for all subsets u with 7« = (see also [7]). 

4. Digital (i, a, f3, n x to, s)-nets and digital (t, a, (3, a, s)-sequences. In this 
section we give the definition of digital (t, a, ft, n x m, s)-nets and digital (t, a, /3, a, s)- 
sequences. Similar point sets were introduced in [2J. 

4.1. The digital construction scheme. The construction of the point set 
used here is a slight generalization of the digital construction scheme introduced by 
Niederreiter, see |20j . by breaking with the tradition of having square generating 
matrices. 

Definition 4.1. Let q be a prime-power and let n,m, s > 1 be integers. Let 
C\, . . . , C s be n x m matrices over the finite field ¥ q of order q. Now we construct q m 
points in [0, l) s : for < h < q m — 1 let h = ho + h\q + • • • + /i m _ig m_1 be the g-adic 
expansion of h. Consider an arbitrary but fixed bijection ip : {0, 1, . . . , q — 1} — > ¥ q . 
Identify h with the vector h — (ip(ho), . . . , ip(h m -i)) T E F™, where T means the 
transpose of the vector (note that we write h for vectors in the finite field F™ and h 

for vectors of integers or real numbers). For 1 < j '• < s multiply the matrix Cj by h, 
i.e., 



and set 



•*7, 



C j K=:(y j , 1 (h),...,y j , n (h)) T eFZ, 



q T ^ g« • 

The point set {xq, . . . , a; g ™_ 1 } is called a digital net (over ¥ q ) (with generating ma- 
trices Ci, . . . , C s ). 

For n,m — oo we obtain a sequence {a^Oj x ii • ■ •}; which is called a digital sequence 
(over Fq) (with generating matrices Ci, . . . , C s ). 

Niederreiter's concept of a digital (£, to, s)-net and a digital (£, s)-sequence will 
appear as a special case in the subsequent section. Further, the digital nets considered 
below all satisfy n > to. 

For a digital net with generating matrices C\, . . . , C s let T> = T>(Ci, . . . , C s ) be 
the dual net given by 

V = {k E Ng \ {0} : Cjkx + ■■■ + Cjk s = ()}, 
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where for k = (fc 1; . . . , k s ) with kj = Kjq + Kj %q + • • ■ and Kj j S {0, . . . , q — 1} let 
fcj = (<p(Kjfi), . . . , iy9(Kj. n _i)) T . Further, for ^ u C S let X>„ = V((Cj)j^ u ) and 

£>* =D u nNl"l. 

Note that throughout the paper Walsh functions and digital nets are defined using 
the same finite field ¥ q and the same bijection tp. 

The following lemma is a slight generalization of [25] Lemma 2.5]. 

Lemma 4.2. Let {xq, . . . , x q m_i} be a digital net over ¥ q with bijection tp, where 
ip(0) — 0, generated by the n x m matrices Ci, . . . , C s over ¥ q , n,m > 1. Then for 
any vector k = (fci, . . . , k s ) of nonnegative integers < k±, . . . , k s < q n we have 

*V^ X J> z/feGPU{0}, 

F,, v wal fc (a; ft ) = < 



/i=0 



else, 



where is t/ie zero vector in 



4.2. (t, a, /3, n x m, s)-nets and (t, a, f3, a, s)-sequences. Digital (t, a, (3, m, s)- 
nets and digital (t, a, /?, s)-sequences were first introduced in [2]. Those point sets were 
used for quasi-Monte Carlo rules which achieve the optimal rate of convergence of the 
worst-case error in Korobov spaces (which are reproducing kernel Hilbert spaces of 
smooth periodic functions). By a slight generalization of digital (t, a, /3, m, s)-nets we 
will show that those digital nets also achieve the optimal convergence of the worst-case 
error in the space £ s ,q,&,~t for all 1 < ■& < a. 

The t value of a (t, m, s)-net is a quality parameter for the distribution properties 
of the net. A low t value yields well distributed point sets and it has been shown, 
see for example [H [20] , that a small t value also guarantees a small worst-case error 
for integration in Sobolev spaces for which the partial first derivatives are square 
intcgrable. In [2] it was shown how the definition of the t value needs to be modified 
in order to obtain faster convergence rates for periodic Sobolev spaces for which the 
partial derivatives up to order 5 < /3 are square integrable. Here we extend those result 
in several ways. First we generalize the digital (t, a, ft, m, s)-nets used in [2] to digital 
(t , a, /3, n X to, s)-nets and show that we then can remove the periodicity assumption 
necessary in [2]. Further, if the derivatives up to order 5 also have bounded variation 
with coefficient < A < 1, then we have shown that such functions are in £ Sjqj s+x,y 

In the following we repeat some definitions and results from [2J and give the 
definition of digital (t, a, (3,n x m, s)-nets and digital (£, a, /?, a, s)-sequences. 

For a real number ■& > 1 the definition of the Walsh space £ s ,q,&,-y suggests to 
define the metric (i qt #(k,l) = /%#(& 1) on Ng, where Hq,&(k I) is given by (|3.14p 
and (|3.15p , which is an extension of the metric introduced in [H] , see also [2S] (the 
metric for $ = 1 can be used for Walsh spaces for example considered in 0]; for 
this case one basically obtains the metric in [19l [28]). As we will see later, in order 
to obtain a small worst-case error in the Walsh space £ s , q ,tf,-y we need digital nets 
for which mhi{(i q ^(k) : k £ £>} is large. By translating this property into a linear 
independence property of the row vectors of the generating matrices Ci, . . . , C s we 
arrive at the following definition. 

Definition 4.3. Let n, m, a > 1 be natural numbers, let < f3 < am/n be a 
real number and let < t < (3n be a natural number. Let ¥ q be the finite field of 
prime power order q and let C\, . . . ,C S 6 F™ xm with Cj = (c^i, . . . , Cj, n ) T ■ If for all 
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1 < ij V . < ■ ■ ■ < iji < n, where < i/j < to for all j = 1, . . . , s, with 

s min(iA,,a) 

J2 J2 iu <fi n -t 



the vectors 



j=i i=i 



C l,*l,v 1 I • ■ • ! ^1,U, 



are linearly independent over ¥ q then the digital net with generating matrices C\ , . . . , C s 
is called a digital (t, a,(3,n x to, s)-net over F g . Further we call a digital (f, a,(3,n x 
m, s)-net over ¥ q with the largest possible value of /3, i.e., /3 = am/n, a digital 
(£, a,n x m, s)-net over F g . 

If t is the smallest non-negative integer such that the digital net generated by 
C\, . . . , C s is a digital (£, a,j3,nx to, s)-net, then we call the digital net a strict digital 
(£, a„8,nx to, s)-net or a strict digital (£, a, n X m, s)-net if /3 = am/n. 

Remark 4.4. Using duality theory (see [52]) it follows that for a digital (t, a,/3,nx 
to, s)-net we have minfc e x> jj, q>a {k) > /3n—t and for a strict digital (£, a, f3,nx to, s)-net 
we have min^gx) IJ-q,a{k) = /3n — f + 1. Hence digital (£, a, (3,n x to, s)-nets with high 
quality have a large value of /3n — t. 

Remark 4.5. In summary the parameters t, a, /3, n, m, s have the following mean- 
ing: 

• s denotes the dimension of the point set. 

• n and to denote the size of the generating matrices for digital nets, i.e. the 
generating matrices are of size n x to; in particular this means the point set 
has q m points. 

• t denotes the quality parameter of the point set; a low t value means high 
quality. In the upper bound, t is a quality parameter related to the constant 
in the upper bound. 

• (3 is also a quality parameter. We will see later that the integration error 
is roughly q~ n . This is of course only true within boundaries, which is the 
reason for the parameter /3, i.e. the integration error is roughly q~" n . Hence 
/3 is a quality parameter related to the convergence rate. 

• a is the smoothness parameter of the point set. 
We can group the parameters also in the following way: 

• m,n,s are fixed parameters, i.e. they specify the number and size of the 
generating matrices. 

• a is a variable parameter, i.e. given (fixed) generating matrices can for 
example generate a (ti,l,/3i,10 x 5, 5)-net, a (i 2 ,2,/32,10 x 5, 5)-net, and 
so on (note the point set is always the same in each instance; the values 
t\, £2, • • • ,01,02, ■ ■ ■ may differ). This is necessary as in the upper bounds a 
will be the smoothness of the integrand, which may not be known explicitly. 

• t and j3 are dependent parameters, they will depend on the generating ma- 
trices and on a. For given generating matrices, it is desirable to know the 
values of j3 and t for each value of a G N. 

Digital (i, a, /3,n x to, s)-nets do not exist for arbitrary choices of the parameters 
t, a, /3, n, to, s, see [2]. The digital nets considered in [2] had the restriction that n = to 
and special attention was paid to those digital nets with high quality, i.e., where a = /3. 
In this paper, a special role will be played by those digital nets for which n = am 
and (3 = 1. The restriction on the linear independence of the digital nets comprises 
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now n — t = am — t row vectors, which is the same as in [2J, with the only difference 
that the size of the generating matrices is now bigger as now each generating matrix 
has n = am rows. As those digital nets play a special role in this work we have the 
following definition. 

Definition 4.6. A digital (t, a, l,am x m, s)-net over ¥ q is called a digital 
(£, a, am x to, s)-net over ¥ q . A strict digital (t, a, 1, am x m, s)-net over ¥ q is called 
a strict digital (t, a, am x to, s)-net over F g . 

Remark 4.7. For practical purposes we would like to explicitly know digital 
(t, a, am x to, s)-nets for all a, m, s > 1 with t as small as possible (as will be shown 
later, they achieve the optimal rate of convergence of the integration error of inte- 
grands for which all mixed partial derivatives of order a are, for example, square 
integrable, thus their usefulness). 

Further, for given a, m, s > 1 and a given digital (i, a, am x m, s)-net P, we would 
then also like to know the t' and f3' value of this point set P when viewed as a digital 
(t',6,/3',am x m, s)-net for all values 8 £ N, i.e., t' and /3' are functions of 6 (this 
is because we would also like to know how well such a digital net P performs if the 
integrand has partial mixed derivatives of order up to 5, because we might not know 
the smoothness of the integrand, but still would wish that P performs best possible). 

We can also define sequences of points for which the first q m points form a digital 
(t, a, f3,n x to, s)-nets. In the classical case [20] one can just consider the left-upper 
to x m submatrices of the generating matrices of a digital sequence and determine the 
net properties of these for each m £ N. Here, on the other hand, we are considering 
digital nets whose generating matrices are n X to matrices. So we would have to 
consider the left-upper n m x m submatrices of the generating matrices of the digital 
sequence for each m 6 N and where (n m ) m e® is a sequence of natural numbers. For 
our purposes here it is enough to consider only n m of the form am, for some given 
a £ N. 

Definition 4.8. Let a, a > 1 and t > be integers and let < fi < a/a be a 
real number. Let ¥ q be the finite field of prime power order q and let Ci, . . . , C s G 
pooxoo w jth Cj — (cj : i,Cj,2 7 . . .) T . Further let Cj tamxm denote the left upper am x m 
submatrix of Cj. If for all to > t/(f3a) the matrices Ci.<j„ ixm , . . . , C Si(7mxrn generate 
a digital (t, a, /?, am x m, s)-net then the digital sequence with generating matrices 
Ci, . . . , C s is called a digital (t, a, /?, a, s)-sequence over F g . Further we call a digital 
(t, a,l,a, s)-sequence over ¥ q a digital (t, a, s)-sequence over ¥ q . 

If t is the smallest non-negative integer such that the digital sequence generated 
by Ci, . . . , C s is a digital (£, a, /3, a, s)-sequence, then we call the digital sequence a 
strict digital (t, a, /3, a, s)-sequence or a strict digital (t, a, s)-sequence if a = a and 
(5 = 1. 

For short we will often write (t, a, /3,n x m, s)-net instead of digital (t, a, /3, n x 
m, s)-net over ¥ q . The same applies to the other notions defined above. 

Remark 4.9. Note that the definition of a digital (t, 1, to X to, s)-net coincides 
with the definition of a digital (t,m, s)-net and the definition of a digital (t, l,s)- 
sequence coincides with the definition of a digital (t, s)-sequence as defined by Nieder- 
reiter (5^. Further note that the t-value depends on a, (3 and a, i.e., t = t(a, /?, a) or 
t = t(a) if a — a and j3 = 1. 

The definition of (t,a, s)-sequences here differs slightly from the definition in 
[2]. Indeed the definition of a (t, a, s)-sequence in [2] corresponds to a (t,a,a, l,s)- 
sequence in the terminology of this paper, whereas here we call a (t, a, 1, a, s)-sequence 
a (t, a, s)-sequence. On the other hand note that the condition of linear independence 
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in Definition 14.31 is the same in both cases, i.e., the sum in + • • • + ii ]m i n („ 1)Q .) + • • • + 
*«,i+" ' '+*s,min(y s ,a) needs to be bounded by am—t for all m for (t, a, 1, a, s)-sequences 
and also for (£, a, a, 1, s)-sequences. 

4.3. Some properties of (t, a, ft,n x m, s)-nets and (t, a, ft, a, s)-sequences. 
The properties of such digital nets and sequences shown in [5] also hold here. For ex- 
ample it was shown there that a digital (t, a, m, s)-net is also a digital ( \ta'/a] , a', m, s)- 
net for all 1 < a' < a and every digital (t, a, s)-sequence is also a digital ( \ta'/a] , a', s)- 
sequence for all 1 < a' < a. In the same way we have the following theorem. 

Theorem 4.10. Let P be a digital (t,a,ft,n x m,s)-net over ¥ q and let S be a 
digital (t, a, ft, a, s) -sequence over ¥ q . Then we have: 

(i) P is a digital (t',a, /3',n x m, s) -net for all 1 < ft' < ft and all t < t' < ft'm 

and S is a digital (t' , a, ft' , a, s) -sequence for all 1 < ft' < ft and all t < t' . 
(ii) P is a digital (t', a' , ft' , nxm, s) -net for all I < a' < n where ft' — /3min(a, a')/ a 

and t' = \tmm(a,a')/a\ and S is a digital (t f , a' ,0' ,o~,s) -sequence for all 

a' > 1 where ft' — ft min(a, a')/a and where t' — [imin(a, a') /a] . 
(Hi) Any digital (t,a,n x m,s)-net is a digital (\ta' / a] , a' , n x m,s)-net for all 

1 < a' < a and every digital (t, a, a, s)-sequence is a digital ([ta'/a] , a', a, s)- 

sequence for all 1 < a' < a. 
(iv) IfCi, . . . ,C S £ Z^ xm are the generating matrices of a digital (t, a, ft, nxm, s)- 

net then the matrices C}™ , . . . , Cj 1 , where C consists of the first n' rows 
of Cj, generate a digital (t, a, ft, n' x m, s)-net for all 1 < n' < n . 
(v) Any digital (t, a, ft, a, s) -sequence is a digital (t, a, ft, a' ,s)- sequence for all 

1 < a' < o-. 

4.4. Constructions of it, a, ft, n x m, s)-nets and (t,a,a, s)-sequences. In 

this section we show how explicit examples of (t, a, ft,nx m, s)-nets and (t, a, ft, a, s)- 
sequences can be constructed. The idea for the construction is based on the construc- 
tion method presented in [2]. 

Let d > 1 and let C\,..., C s d be the generating matrices of a digital (t, m, sd)- 
net. Note that many explicit examples of such generating matrices are known, see 
for example [501 HS1 EH] and the references therein. For the construction of a 
(t, a, m, s)-net any of the above mentioned explicit constructions can be used, but as 
will be shown below the quality of the (t, a, m, s)-net obtained depends on the quality 
of the underlying digital (t, m, s<i)-net on which our construction is based on. 

Let Cj = (cj i, . . . , Cj. m ) T for j = 1, . . . , sd, i.e., Cjj are the row vectors of Cj. 
Now let the matrix C- be made of the first rows of the matrices C(j_i)d+i, • ■ • , Cjd, 
then the second rows of C(j_i)d+i> ■ • • , Cjd and so on. The matrix C- is then an 

dm x m matrix, i.e., C- — (c- { , . . . , c- j m ) T where c- { = c uv with I = (v — j)d + u, 
1 < v < m and (j — l)d < u < jd for 1 = 1,..., dm and j = 1, . . . , s. The following 
result is a slight generalization of [2, Theorem 3] and can be obtained using the same 
proof technique. 

Theorem 4.11. Let d > 1 be a natural number and let C\, . . . , C s d be the gener- 
ating matrices of a digital (t! , m, sd)-net over the finite field ¥ q of prime power order q. 
Let C{ , . . . ,Cs be defined as above. Then for any a > 1 the matrices C{ , . . . ,Cs 
are generating matrices of a digital (t, a, min(l, a/d), dm x m, s)-net over F q with 



t = min(a, d) t + 



s(d — 1) min(a, d) 
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The above construction and Theorem l4. 1 II can easily be extended to (t, a, (3, a, s)- 
sequences. Indeed, let d > 1 and let C\, . . . , C s d be the generating matrices of a digital 
(£, s<i)-sequence. Again many explicit generating matrices are known, see for example 
[9j [20j [23l [31]. Let Cj = {cj t \,Cj t 2,- ■ -) T f° r 3 = l,-.-,sd, i.e., Cj t i are the row 
vectors of Cj. Now let the matrix C be made of the first rows of the matrices 
C(j— i)d+ii • • ■ 7 Cjd, then the second rows of C/j_ud+ii ■ • • ■> Cjd and so on, i.e., 

r<( d ) _ i \T 

3 ~ v C 0'-l)d+l,l! ■ ■ • ' C jd,l' C (j— l)d+l,2; ■ • ■ ) c jd,2j ■ • -J ■ 

The following theorem states that the matrices C{ , . . . , Cj are the generating ma- 
trices of a digital (t, a, min(l, a/d), d, s)-sequence, compare with Theorem 4]. 

Theorem 4.12. Let d > 1 be a natural number and let C\, . . . , C s d be the gen- 
erating matrices of a digital (t' ', sd)- sequence over the finite field ¥ q of prime power 
order q. Let C{ ,..., Cs be defined as above. Then for any a > 1 the matri- 
ces C[ , . . . ,Cs are generating matrices of a digital (t, a, min(l, a/d), d, s)-sequence 
over ¥ q with 



t = min(a, d) t + 



s(d — 1) min(a, d) 



The last result shows that (t, a, j3, am x m, s)-nets indeed exist for j3 — 1 and any 
< a < a and for m arbitrarily large. We have even shown that digital (t, a, /?, am x 
to, s)-nets exist which are extensible in m and s. This can be achieved by using an 
underlying (t! , s<i)-sequence which is itself extensible in to and s. If the t' value of 
the original (£', m, s)-net or (t' , s)-sequence is known explicitly then we also know the 
t value of the digital (t, a, (3, am x to, s)-net or (t, a, f3, a, s)-sequence. Furthermore 
it has also been shown how such digital nets can be constructed in practice. Further 
results on such sequences are established in [2]. 

5. Numerical integration in the Walsh space £ S)gj # j7 . In this section we 
investigate numerical integration in the Walsh space £ 8 qd*f using quasi-Monte Carlo 
rules 



1 9-1 



Q 



where Xq, . . . , x q m_i are the points of a digital (£, a, /3, m, s)-net over F g . More pre- 
cisely, we want to approximate the integral 



W)= I /(*)daj 

./[0.11 s 



[0,1]= 

by the quasi-Monte Carlo rule Q q ™, s (f)- As a quality measure for our rule we intro- 
duce the worst-case error in the next section. 

5.1. The worst-case error in the Walsh space £ s , q ,§,-y- The worst-case error 
for the Walsh space £ s ,q,d,-f using the quasi-Monte Carlo rule Q q ™, s is given by 

e(Qq™,s,£s,q,-&,-y)= Sup |J 8 (/) - Qq™, s (f)\ . 
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The initial error is given by 

e(Qo, s , Eaatfrr) = sup \h{f)\- 

ll/lle , <i 

^ s ,q,u ,-y 

In the following we use digital nets generated by the matrices Ci, . . . , C s as 
quadrature points for the quadrature rule Q q m s . Let / G £ s ,g,i),-r Using Lemma[ 
it follows that 



\I S (f)-Q q m, 3 (f)\ 



E /» 



fce» 



<£i/»i = £ E i/>«><wi- 

fcec 0/ucsfe„ex>j 

Now we have \f(k u ,O s \ u )\ < ■y u rq^(k u )\\f\\s, iq ^^ and thus we obtain 

\W)-Qf»Af)\<\\f\\e.. t .*„ E 7« E r ^( fc -)- ( 5A ) 

By choosing f(k u , 0s\ u ) = luTq,^{k u ) for all u and fc„ we can also obtain equality in 
(f5TTj) . Thus we have 

e{Qq">,s,£s,q,0,-y) = E 7 " E r <lA k u)- (5.2) 

From the last formula we can now see that essentially a large value of mhi{fj iq ^(k) : 
k G V} guarantees a small worst-case error. Further it can be shown that 

e(Qo, s ,£s,q,d,-y) = 70- (5-3) 

We have shown the following theorem. 

Theorem 5.1. The initial error for multivariate integration in the Walsh space 
£s,q,&,~i is given by Ii5.3\) and the worst-case error for multivariate integration in the 
Walsh space £ s ,q,$*i using a digital net as quadrature points is given by Ii5.2\) . 

In the following lemma we establish an upper bound on the sum ^2 k ^ v , r q ^{k u ) 
for digital (t, a, /3,n x to, s)-nets over ¥ q . The proof is similar to [H Lemma 6]. 

Lemma 5.2. Let $ > 1 be a real number, q > 2 be a prime power, C±, . . . , C s G 
jpnxm £ e ^ e g enera n n g matrices of a digital (t, p#],/3,n x m,s)-net over ¥ q with 
< P < 1 and let £>* = 2?*((Cj)j 6tl ). For all ^ u C S we have: if $ is not an 
integer it follows that 

E r q A k u) < C\„ u Mn-t+ rtfUMW-V'LC/Jn-t)/™ 

w/iere 

C WU * = gl"ir*l ((a - ? *-W)-i + (i _ g(i-*)/W)-l«ir*l) 
and z/ 1? zs an integer it follows that 

E »**(*«) < C{ uU ,((3n t + ^)I«I^-W»-*), 
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where 

cf.i,„* = « M *(ff- 1 + (i-fl 1/ *- 1 )- M *). 

Proof. To simplify the notation we prove the result only for u = S. For all other 
subsets the result follows by the same arguments. 

We first consider the case where $ > 1 is not an integer. We partition the set T>* s 
into parts where the highest digits of kj are prescribed and we count the number of 
solutions of Cj k\ + • • • + Cj k s = 0. For j = 1, . . . , s let now ijj&\ < ■ ■ ■ < ij.i with 
ij.i > 1- Note that we now allow ij.i < 1, in which case the contributions of those 
ij.i are to be ignored. This notation is adopted in order to avoid considering many 
special cases. Further we write i s r#] = (ii,i, ■ ■ ■ , H,\&\ ■> ■ ■ ■ > *s,i> •■•■> Vrtf] ) an d define 

^S(«.,r*i) = {keV* s : kj = [Kjtf*' 1 - 1 + ■■■ + K J -, w g^r*i- 1 + 1,\ 

with < lj < g^'Toi _1 and 1 < Kjj < q for j = 1, . . . , s}, 

where ['J just means that the contributions of i^ < 1 are to be ignored. Let 
Then we have 

E ^(M=E- E -E- E ^fe# M) 

ks£T , s *i,i = l *i,r»l=l i 3 , 1 = 1 »s,["i?l = l 

Some of the sums above can be empty in which case we just set the corresponding 
summation index ijj = 0. 

Note that by the (t, \'ff},/3, nxm, s)-net property we have that |2?s(*s,r#"|)| = as 
long as ii,i+- • -+iiJ#] +• • •+« s ,i+- • •+i s ,rtfi < /3n—t. Hence let now < ii ; i, . . . , i 8i mi 
be given such that ii t \, . . . , i 8 ,i > 1, *j,[>?l < • • • < ij,i for j = 1, . . . , s and where if 
ij.i < 1 we set ijj = (in which case we also have ij,i+i = ij,i+2 = . . . = and the 

inequalities ijj > ■■• > ij.\a~\ are ignored) and ii,H MirtfiH hig.iH t-i S) r#] > 

pn — t. We now need to estimate \D* s (i s .[^])|, that is we need to count the number 
of k e T>% with kj = L^i^' 1-1 -\ h ^^i'^'r 1 + lj\. 

There are at most (q— 1)^1* choices for K ll7 . . . , K a r#-\ (we write at most because 
if ij.i < 1 then the corresponding Kj t i does not have any effect and therefore need not 
to be included). 

Let now 1 < Ki,i, . . . , K s ,[~#] < 9 be given and define 

9 = Kl,!^!,! + • • • + K l,r«?l c M lirfll + • • • + Wa.lC^i.,! + ' ' ' + «s,r^l C Zia,r^1 ' 

where we set cj, = if I < 1 or Z > n. Further let 

" = ( C l,l> • • ' > C l,t lirtfl -l) ■ • * > C s,l) • ' ■ T C s,i s ^^-l>- 

Now the task is to count the number of solutions I of Bl = g. 

As long as the columns of B are linearly independent the number of solutions can 
at most be 1. By the (t, p$] , /3, n x m, s)-net property this is certainly the case if (we 
write (x)+ = max(x, 0)) 

{h,\#} - 1)+ H 1- (ii.r^l - |"#1)+ h — 

+(* s ,r#i - !)+ h K* s ,h?i - 1^1 )+ -5 r^l(*i,rt?i h i-*sW 

< Pn-t, 
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that is, as long as 

/3n-t 

Let now ii,^] H 1- i s , p0] > r^T* ' Then by considering the rank of the matrix 

B and the dimension of the space of solutions of Bl — it follows the number of 
solutions of Bl = g is smaller or equal to q % ^-\n+---+ l s,\^}-V{P n - t )/\ f) M _ Thus we have 

PS(vr*i)l = o 

if X^=i E[=i *j,/ <Pn-t, we have 

if 5Z* =1 Ej=i *i,l > fin-t and X^=i *M*1 ^ 7^ an d finally we have 

if Ei=i E]=i *i,/ > Pn-t and £* =1 «j, ^ > ^. 

We estimate the sum (|5.4|) now. Let 5i be the sum in (|5.4[) where ii^H M s ,r#] > 

j3n-t and i i: ^ H h i s , ft?l < ^p Let Zi = iu H \- h,i#]-i H h i*,i + 

• ' • + *s,[t5]-i and let Z2 = ii,r#i + • • • + i a ,\-&] ■ Let A(l\ + I2) denote the number of 
admissible choices of ii t i, . . . , i a J#] such that h+h = 11,1 + ■ ■ • + is,[-&] ■ Then we have 

ft -(9-1) 1 2. „(tf- L tfj)z 2 Z. ^ ■ 

i 2 =0 Zi=/3n-i+l-! 2 



We have A(J X + l 2 ) < C 1 ^^ 1-1 ) an d hence we obtain 



L^frrJ 



Z 2 =0 y Ji=^n-t+l-i a V 7 

From a result by Matousek [17, Lemma 2.18], see also ;4, Lemma 6], we have 

iy ; ^ q i A s m-i 

l^/Sn-t+l-h y v ' 

- 9 ^ 401-1 

and further we have 

1 3"-* 1 
\-~rwr-i 



£ ^ wi _^_ g(rtf1 - tf)(LtfB - t)/rtf1J+1) _ 1 

^ (t(*-W)^ ^ V (tW-O-1 

i 2 =0 y z 2 =o y 
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Thus we obtain 

x - q m-*-i q \ 401-1 

- i-q*-m \ a\&\ -1 / 

Let 5*2 be the part of (|5.4[) for which i^i + • • • + i 3 [&\ > (3n — t and ii rtf] H h 

* s ,r#| > ^fyp i-e., we have 



a<(?-i)' w E ••• E 



u,r»i : 



~ Is,r ^ 1 \-L(/3«-«)/r'?lJ (7 (*i.r^+-+'' s ,r« 1 )(r'51-'<?) 

<*_^ ^_^ ff* 1 ' 1 ^ Mi,f#i-iH Ms.iH Ha, ["-fll-i 

where we have the additional conditions ii,i + • • • + i 8 mi > /3?i — £ and ii mi + ' ' " + 

i a ,\#] > HTT- As above let Zi = i\ t \ -\ V ii,\#\-\ + h i«,i + • • • + i«,fi?i-i and 

let ?2 = *i,[i?] + • • • + *s,[i5] ■ Let A(l± + I2) denote the number of admissible choices 
of ii^i, . . . , i a ,M] such that £1 + Z2 = ii,i + ■ • ■ + i«,W] ■ Note that Zi > |_$JZ2- Then we 
have j4(Zi + Z2) < ( jpjnli ) an d hence we obtain 

S 2 <(q-iy m q- [{0n ~ t)/mi 

1 (h + h + s\ff\ -1 



i 2=L ^J+i ; 1= L-.Jji 2 +i v ' 

v^ v „-ii+b-i-b* /^ + fe + ^2 - 1 + *r*i - a 

By using again Matousek [XT1 Lemma 2.18], see also [4j Lemma 6], we have 
(q l) sW E q- 1 ^- 1 - 1 ^ ( h + h + m * ~ l + ^ ~ *) 

<fl .w fa(i-*)-i^l-i + *W-A 

- 9 ? i, s rtfi-i ; 



Ji=0 

< 



and also 



5' 



rtfi-i-L03n-t)/r*u v^ o^ti-ij) f^r^l - 1 + sfi?] - 1\ 

i 2 =p n -t v ' ' 

1 q ' \ s^]-i r 



<q 
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Hence we have 



V s [t?] - 1 



9 



Note that we have J^ fc eIJ » r q .$(ks) = S\ + S 2 - Let a > 1 and 6 > be integers, 
then we have 

*:0-n( 1+ 7)*< i+ "'- 

7 »=1 



^r^i-i 



Therefore we obtain 

Si < 1 q _ ' q »_ m Wn-t + 2)'^-i g -n(/?n-«)/ WJ 
and 

s 2 < «*™ (1 - ? (i-*)/r*i)-r*i (/3n _ t + r^j.w-ig-^n-tj/w . 

Thus we have 

£ r^(fc 5 ) < C\ g ,,(/3n-* + rtflJ^l-ig-'LOfc.-tVWJ, 

where 

C,,,tf - ^1 ((« - g*"L*J J" 1 + (1 - qV-V/W )-W ). 

The result follows for the case < t? — |_$J < 1- 

Let now $ > 1 be an integer. Then using the same arguments as above it can be 
shown that 

Si < (fin - t + 2) sl5 ' q -(f>n-t)-i+s# 

and 

S 2 < q s *(l - q 1 '*- 1 )-"*^ - t + tf^-ig-O*"-*). 

Thus we have 

]T r iA*s) < Coffin -t + Wtq-W"-*, 

k s ev* s 

where 

c;^ = «- tf («- 1 + (i-g 1 /*- 1 )- tf ). 

The result now follows. D 

Remark 5.3. We note that the above lemma does not hold for (3 > 1 in general. 
Indeed, take for example u — {1}, then k u = (k\) and choose k\ — q n . Then the 
digit vector of the first n digits of q n is (0, . . . ,0) T and hence Cj k\ = and hence 
fe(i) € ^(i)- Thus 

E ^(fe(i))>9-" _1 
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and hence a counterexample can be obtained for some choices of n, /3, i9. 

In [5] we did allow /3 > 1, but therein we had the additional assumption that the 
functions are periodic. In this case we were able to show that the Walsh coefficients 
r qta (k,l) = Ylj—i r q _ a (kj, lj) of the reproducing kernel also satisfy the additional 
property that r q ^ a (q m kj,q m kj) = r q ^ a (q m kj) = q~ 2am r qya (kj,kj) for all kj,m S N, see 
[2j Lemma 15]. Similarly, if we would also assume here that r q ^(q n k) = q~ r q ^(k) 
and r q &(k) given as above if q J(k, then the above counterexample would fail as then 
r,,»(«") = r q f(lq n ) = q-* {n+l) r q ^{l). 

Using the above lemma we can now obtain an upper bound on the worst-case 
error. 

Theorem 5.4. Let •& > 1 be a real number and q > 2 be a prime power. The 
worst-case error for multivariate integration in the Walsh space £ St q$ n using a digital 
(£, \{f\,P,n x m,s)-net over ¥ q , with < < 1, as quadrature points is for non- 
integers i9 bounded by 



e(Q q m, s ,S s , q ,o„)<q-^n-tVmi J2 7«q«|,,,*G9n-<+W) |u|r * 1 " 1 . 
where 






and if $ is an integer, the worst-case error is bounded by 

e(Q qm , s ,S s , q ,^)<q-^ n -^ ]T luC' luUi) ((3n - t + d)^ , 

0#uC5 

where 

C\ u \, q ,» = ^(ff- 1 + (1 - q 1 '*- 1 )-^). 

As a direct consequence of Corollary 13. Ill we obtain the following result. 

COROLLARY 5.5. Let 8 > 1 be an integer, < A < 1 and q > 2 be a prime power. 
Then for any function f : [0, l) s — > R whose partial mixed derivatives up to order 5 
exist it follows that the integration error using a digital (t, 5 + 1, (3,n X m, s)-net over 
¥ q with < /3 < 1 as quadrature points is for < A < 1 bounded by 

\W)~Q<rM)\ < Q- {S+xmn ' mS+1)i C StS>q ,-yNs !X<y (f) 

E luC\uU,5+x{Pn -t + S+ l)l«K*+i)-i 

and for A = 1 the integration error is bounded by 

\W) - Q,»,.(/)| < q- (0n - t] Cs,s, q ,-yN s ^{f) 

E ^C[ uUS+1 ((3n-t + 5 + l)M s+ \ 

where the constant Cs, s> q,i is given in Corollaru \3.11\ and the constants Ci u \ q s+\ and 
C! , s+1 are given in Theorem \5.4\ 

Explicit constructions of digital (£, a, min(l, a/d), dm x to, s) -nets over ¥ q for 
all prime powers q, integers a,d,m,s > 1 are given in Section 14.41 By choosing 
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d = a = \t>] = 5 + 1, by Theorem 15.41 and Corollary 15.51 we obtain a convergence 
of 0(q~^ m m s ^^ +1 ), which is optimal even for the smooth functions contained in the 
Walsh space £ s ,q,d,-y, see [H] where a lower bound for smooth periodic functions was 
shown. 

Remark 5.6. In [51 Remark 4] it was noted that if m = n and j3 > a the t- value 
must grow with m and hence the restriction f3 < a was added. A similar argument 
yields in our case that the t- value must grow with n if f3n > am as Theorem 15 . 41 shows 
a convergence of 0(q~ /3n+t ) but the best possible convergence rate is q~ am , hence the 
restriction f3 < am/n was added. 

In case the smoothness of the function is not known our constructions adjust 
themselves automatically up to a certain degree in the following way: for the con- 
struction of the digital net we choose some value of d > 1 and construct a digital 
(t, a, min(l, a/d), dm x m, s) -net or a digital (£,a,min(l, a/d),d, s)-sequence for all 
a > 1. The values 5 > 1 and < A < 1 determine the real smoothness of the function, 
which we now assume is not known. The value of a is the smoothness analog for the 
digital net, i.e., we need to choose a = 5 + 1. First assume that 5 + A < d, then 
min(l, a/d) — (S + l)/d and therefore we have [3 = (6 + l)/d. As n = dm it follows 
that (3n — (5 + l)m and therefore Corollary 15.51 shows that we achieve a convergence 
of 0(q~( s+x * ,m m s ( 5+1 ) +1 ), which is optimal. Now assume on the other hand that 
5 + A > d, then min(l, a/d) = 1 and therefore /3 = 1. Again we have n = dm and 
hence /in — dm. In this case Corollary 15.51 shows that our construction achieves a 
convergence of 0(q- dm m s( - 5+1 '> +1 ). 

Note that numerical integration of functions with less smoothness, i.e., for exam- 
ple functions with partial mixed derivatives up to degree 1 in /Z2QO, l) s ) or functions 
with bounded variation, has been considered in many papers and monographs, see 
for example [3j IH [6j HH [201 1301 E] ■ Using the notation from above, basically those 
results are concerned with the case where 6 = and A = 1, hence the results here 
are a direct continuation of what was previously known. The construction of digital 
nets proposed here for d = 1 yields obviously digital (t, m, s)-nets and (£, s)-sequences 
as for example defined in 20 . In view of Corollary 15.51 and the explanation which 
followed it is hence not surprising that the classical examples and theory (see for ex- 
ample [U [H H21 [131 HU [201 E3 [31] ) only yielded a convergence of 0{q m( -- 1+£ ^) for any 
e > (the e here is used to hide the powers of m). 

Note that the worst-case error in the Walsh space £ s .q.^.~f is invariant with respect 
to a digital shift (see [4]), hence Corollary 15.51 also holds for digitally shifted digital 
nets. Thus, if one wants to use randomized digital nets, one can also use randomly 
digitally shifted digital nets. The root mean square worst-case error for this case 
would of course be bounded by the bound in Corollary 15.51 as this bound holds for 
any digital shift, i.e., our result here is even stronger in that we have shown that 
even for the worst digital shift we still have the bound of Corollary 15. 51 From this, it 
follows that for our situation here, there is, in some sense, no bad digital shift. Other 
more sophisticated scrambling methods which do not destroy the essential properties 
of the point set can be used as well (for example a digital shift of depth m, see [51IT7]). 
see [24] for some ideas in this direction. 
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